On integral points on surfaces 



P. Corvaja U. Zannier 

Abstract. We study the integral points on surfaces by means of a new method, relying on the Schmidt 
Subspace Theorem. This method was recently introduced in [CZ] for the case of curves, leading to a new 
proof of Siegel's celebrated theorem. Here, under certain conditions involving the intersection matrix of the 
divisors at infinity, we shall conclude that the integral points on a surface all lie on a curve. We shall also 
give several examples and applications. One of them concerns curves, with a study of the integral points 
defined over a variable quadratic field; for instance we shall show that an affine curve with at least five points 
at infinity has at most finitely many such integral points. 

§0. Introduction and statements. In the recent paper [CZ] a new method was introduced in connection 
with the integral points on an algebraic curve; this led to a novel proof of Siegel's celebrated theorem, based on 
the Schmidt Subspace Theorem and entirely avoiding any recourse to abelian varieties and their arithmetic. 
Apart from this methodological point, we observed (see the Remark in [CZ]) that the approach was sometimes 
capable of quantitative improvements on the classical one, and we also alluded to the possibility of extensions 
to higher dimensional varieties. The present paper represents precisely a first step in that direction, with an 
analysis of the case of surfaces. 

The crux of the arguments in [CZ] appeared through the special case of Siegel's Theorem when the 
affine curve misses at least three points with respect to its projective closure; this condition alone implies 
the finiteness of the set of integral points. That case was studied by embedding the curve in a space of large 
dimension and by constructing hyperplanes with high order contact with the curve at some point at infinity; 
finally one exploited the diophantine approximation through the Schmidt Theorem rather than through the 
Roth's one, as in the usual approach. Correspondingly, here we shall work with (nonsingular) affine surfaces 
missing at least four divisors; but now, unlike the case of curves, we shall need additional assumptions on 
the divisors, expressed through their intersection matrix. These conditions appear naturally when using the 
Riemann-Roch Theorem to embed the surface in a suitable space and to construct functions with zeros of 
large order along a prescribed divisor in the set, allowing an application of the Subspace Theorem. 

The result of this approach is the Main Theorem below. Its assumptions appear somewhat technical, 
so we have preferred to start with its corollary Theorem 1 below; this is sufficient for some applications, 
such as to Corollary 1, which concerns the quadratic integral points on a curve. As a kind of "test" for the 
Main Theorem, we shall see how it immediately implies Siegel's theorem on curves (Ex. 1.5). Still other 
applications of the method may be obtained looking at varieties defined in A m by one equation f\ ■ ■ ■ /,. = g, 
where fi,g are polynomials and deg<? is "small". (A special case arises with "norm form equations", treated 
by Schmidt in full generality; see [SI].) However in general the variety has singularities at infinity, so, even in 
the case of surfaces, the Main Theorem cannot be applied directly to such equation; this is why we postpone 
such analysis to a separate paper. 

In the sequel we let X denote a geometrically irreducible non-singular projective surface defined over 
a number field k. We also let S be a finite set of places of k, including the archimcdcan ones, denoting as 
usual Os = {a e k : \a\ v < 1 for all v £ S}. 

We view the S-integral points in the classical way; namely, letting X be an affine Zariski-open subset 
of X (defined over k), embedded in A m , say, we define an S'-integral point P £ X(Os) as a point whose 
coordinates lie in Os- For our purposes, this is equivalent with the more modern definitions given e.g. in 
[Sel] or [V]. 

Theorem 1. Let X be a surface as above, and let X a X be an affine open subset. Assume that X \ X = 
D\ U . . . U D r , where the Di are distinct irreducible divisors such that no three of them share a common point. 
Assume also that r > 4 and that there exist positive integers pi, . . . ,p r ,c, such that piPj(Di.Dj) — c for all 
pairs i, j . 

Then there exists a curve on X containing all the S-integral points in X{k). 

Below we shall note that one cannot remove the condition on the (Di.Dj) (see Ex. 1.1). 
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An application of Theorem 1 (which does not follow from the mentioned results by Vojta), concerns the 
points on a curve which are integral and defined over a field of degree at most 2 over k; we insist that here 
we do not view this field as being fixed, but varying with the point. This situation (actually for fields of any 
given degree in place of 2) has been studied in the context of rational points, via the former Mordell-Lang 
conjecture, now proved by Faltings; see e.g. [HSi, pp. 439-443] for an account of some results and several 
references. For instance, in the quadratic case it follows from rather general results by D. Abramovitch and 
J. Harris (see [HSi, Thms Ff 2f , F125(i)]) that if a curve has infinitely many points rational over a quadratic 
extension of k, then it admits a map of degree < 2 either to P 1 or to an elliptic curve. 

For integral points we may obtain without appealing to Mordell-Lang a result in the same vein, which 
however seems not to derive directly from the rational case, at least when the genus is < 2. (In fact, in that 
case, Mordell-Lang as applied in [HSi] gives no information at all.) This result will be proved by applying 
Theorem I to the symmetric product of a curve with itself. We state it as a corollary, where we use the 
terminology quadratic (over k) S -integral point to mean a point defined over a quadratic extension of k, 
which is integral at all places of Q except possibly those lying above S. 

Corollary 1. Let C be a geometrically irreducible projective curve and let C — C\{Ai, . . . , A r } be an affine 
subset, where the Ai are distinct points in C(k). Then 

(i) If ' r > 5, C contains only finitely many quadratic (over k) S-integral points. 

(ii) If r > 4, there exists a finite set of rational maps ip : C — > P 1 of degree 2 such that all but finitely 
many of the quadratic S-integral points on C are sent to P : (fc) by some of the mentioned maps. 

In the next section we shall see that the result is in a sense best-possible (see Ex. 1.2-1.3), and we shall 
briefly discuss possible extensions. We shall also state an "Addendum" which provides further information 
on the maps in (ii). 

As mentioned earlier, we have postponed the statement of our main result (which implies Theorem 1), 
because of its somewhat involved formulation. Here it is: 

Main Theorem. Let X be a surface as above, and let X C X be an affine open subset. Assume that 
X \ X = Di U . . . U D r , r > 2, where the Di are distinct irreducible divisors with the following properties: 

(i) - No three of the Di share a common point. 

(ii) - There exist positive integers pi,. .. ,p r such that, putting D := p\D\ + . . . + p r D r , D is ample and 
the following holds. Defining £j ; for i = l,...,r, as the minimal positive solution of the equation 
D 2 £ 2 — 2(D.Di)£ + D 2 = (£, exists; see §2), we have the inequality 

2D% > (£>.A)£ + 3D 2 Pi. 
Then there exists a curve on X containing all the S-integral points in X(k). 

Our proofs, though not effective in the sense of leading to explicit equations for the relevant curve, 
allow in principle quantitative conclusions such as an explicit estimation of the degree of the curve. Also, 
the bounds may be obtained to be rather uniform with respect to the field k; one may use results due to 
Schlickewei, Evertse (as for instance in the Remark in [CZ], p. 271) or more recent estimates by Evertse 
and Ferretti [EF]; this last paper uses the proof-approach to the Subspace Theorem due to Faltings and 
Wiistholz [FW], through the product theorem [F]. However here we shall not pursue in this direction. 

§1. Remarks and examples. In this section we collect several observations on the previous statements. 
Concerning Theorem 1, we start by pointing out that the condition on the (Di.Dj) cannot be removed. 

Example 1.1. Let X = P 1 x P 1 and let D 1 ,...,D 4 be the divisors {0} x P 1 , {oo} x P 1 , P 1 x {0} and 
P 1 x {oo} in some order. Then, defining X := X \ (u| =1 2?j), we see that X is isomorphic to the product of 
the affine line minus one point with itself. Therefore the integral points on X are (for suitable k, S) Zariski 
dense on X. (On the contrary, Theorem 1 easily implies that the integral points on P 2 minus four divisors 
in general position are not Zariski dense, a well-known fact.) 

Theorem 1 intersects results due to Vojta (also obtained through the Subspace Theorem); sec e.g. [V, 
Thms. 2.4.1, 2.4.6]; roughly speaking, these statements predict that the integral points are not Zariski dense, 
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provided there are sufficiently many components at infinity. However, they do not lead directly to the general 
case of Theorem f , as happens for instance when Pic°(X) is not trivial (like in the proof of Corollary 1), or 
when the rank of the subgroup generated by the Di in NS(X) is greater than 1. 

The conditions on the number of divisors Di and on the (Di.Dj) which appear in the Main Theorem 
(and in Theorem 1) come naturally from our method. One may ask how these assumptions fit with celebrated 
conjectures on integral points (see [HSi, Ch. F]). We do not have any definite view here; we just recall Lang's 
point of view, expressed in [L, p. 225-226]; namely, on the one hand Lang's Conjecture 5.1, [L, p. 225], 
predicts at most finitely many integral points on hyperbolic varieties; on the other hand, it is "a general 
idea" that taking out a sufficiently large number of divisors (or a divisor of large degree) from a projective 
variety produces a hyperbolic space. Lang interprets in this way also the results by Vojta alluded to above, 
(i) 

We now turn to Corollary 1, noting that in some sense its conclusions are best-possible. 

Example 1.2. Let a rational map -0 : C — > P 1 of degree 2 be given. We construct an affine subset C C C 
with four missing points and infinitely many quadratic integral points. Let B\, B 2 be distinct points in P 1 (fc) 
and define Y := P 1 \ {£>i, i^}- Lifting B\, Bi by ip gives in general four points A\, . . . , A4 E C. Define 
then C = C\ {A Xl . . . , A4}. Then ip can be seen as a finite morphism from C to Y. Lifting (the possibly) 
infinitely many integral points in Y{Og) by ip produces then infinitely many quadratic ^"-integer points on 
C (for a suitable finite S' D S). 

Concrete examples are obtained e.g. with the classical space curves given by two simultaneous Pell 
equations, such as e.g. t 2 — 2v 2 = 1, u 2 — 3v 2 = 1. We now have an affine subset of an elliptic curve, 
with four points at infinity. We can obtain infinitely many quadratic integral points by solving in Z e.g. 
the first Pell equation, and then defining u = V3u 2 + 1; or we may solve the second equation and then put 

t = \/2v 2 + 1; or we may also solve 3t 2 — 2u 2 = 1 and then let v = \J ^j^. (This is the construction of 
Example 1.2 for the three natural projections.) 

It is actually possible to show through Corollary 1 that all but finitely many quadratic integral points 
arise in this way. ^ We in fact have an additional property for the relevant maps in conclusion (ii), namely: 

Addendum to Corollary 1. Assume that ip is a quadratic map as in (ii) and that it sends to P 1 (/c) an 
infinity of the integral points in question. Then the set ip({Ai, . . . , A4}) has two points. In particular, we 
have a linear- equivalence relation Ylt=i e i(^i) ~ on Div(C), where the G {±1} have zero sum. 

When such a ip exists, the two relevant values of it can be sent to two prescribed points in P 1 (fc) by 
means of an automorphism of P ; in practice, the choice of the maps ip then reduces to splitting the four 
points at infinity in two pairs having equal sum in the Jacobian of C; this can be done in at most three ways, 
as in the example with the Pell equations. The simple proof for the Addendum will be given after the one 
for the corollary. This conclusion of course allows one to compute the relevant maps and to parametrize all 
but finitely many quadratic integral points on an affine curve with four points at infinity. 

Concerning again Cor. 1 (ii), we now observe that "r > 4" cannot be substituted with r > 3. 

Example 1.3. Let C — P 1 \ {—1,0, 00}, realized with the plane equation X(X + l)Y = 1. Let r,s run 
through the S- units in k and define a = , A = a 2 — r. Then the points given by x = a + y/A, 

1J = x ^ s +1 ^ , where x' — a — \/~A, are quadratic S- integral on C. It is possible to show that they cannot all 
be mapped to k by one at least of a finite number of quadratic maps. 

It is also possible to show that for the affine elliptic curve E : Y 2 = X 3 — 2, the quadratic integral points 
(over Z) cannot be all described like in (ii) of Corollary 1. 

Note that E has only one point at infinity. Probably similar examples cannot be constructed with more 
points at infinity; namely, (ii) is unlikely to be best-possible also for curves of genus g > 1, in the sense that 

W Both our method and Vojta's do not work at all by removing a single divisor (but see Ex. 1.4 below). 

( 2 ) On the contrary, the quadratic rational points cannot be likewise described; we can obtain them as 
inverse images from P : (fc) under any map of degree 2 defined over k, and it is easy to see that in general no 
finite set of such maps is sufficient to obtain almost all the points in question. 
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the condition r > 4 may be then probably relaxed. In fact, a conjecture of Lang and Vojta (see [HSi, Conj. 
F.5.3.6, p. 486]) predicts that if X = X\D is an affine variety with Kx + D almost ample (i.e. "big") and 
D with normal crossings, the integral points all lie on a proper subvariety. Now, in the proof of our corollary 
we work with X equal to & 2 \ the two-fold symmetric power of C, and with D equal to the image in 
of J2l=i A-i x C. It is then easily checked that Kx + D is (almost) ample precisely when g = and r > 4, 
or g — 1 and r > 2 or g > 2 and r > 1. In other words, the Lang- Vojta conjecture essentially predicts that 
counterexamples sharper than those given here may not be found. 

To prove this, one might try to proceed like in the deduction of Siegel's Theorem from the special case 
of three points at infinity. Namely, one may then use unramificd covers, as in [CZ], with the purpose of 
increasing the number of points at infinity. (One also uses [V, Thm. 1.4.11], essentially the Chevalley-Weil 
Theorem, to show that lifting the integral points does not produce infinite degree extensions.) 

In the case of the present Corollary 1 a similar strategy does not help. In fact, the structure of the 
fundamental group of & 2 ^ ( - 3 - ) prevents the number of components of a divisor to increase by pull-back on a 
cover. However there exist nontrivial instances beyond the case of curves, and showing one of them is our 
purpose in including this further result, namely: 

Example 1.4. Let A be an abelian variety of dimension 2, let ir : A — > A be an isogeny of degree > 4 and 
let E be an ample irreducible divisor on A. We suppose that for a <G ker7r no three of the divisors E + a 
intersect. Then there are at most finitely many S -integral points in (A \ n(E))(k). 

We remark that this is an extremely special case of a former conjecture by Lang, proved by Faltings [F, 
Cor. to Thm. 2]: every affine subset of an abelian variety has at most finitely many integral points. 

We just sketch a proof. Note now that tt(E) is an irreducible divisor, so Theorem 1 cannot be applied 
directly. Consider D := tt*(tt(E)); since tt has degree > 4, we see that D is the sum of r := dcg7r > 4 
irreducible divisors satisfying the assumptions for Theorem 1, with pi = 1 for i = 1, . . . , r. 

Let now E be an infinite set of ^-integral points in Y(k), where Y = A \ n(E). By [V, Thm. 1.4.11], 
7r _1 (E) is a set of S"-integral points on X(k'), where X = A \ D, for some number field k' and some finite 
set 5" of places of k' . By Theorem 1 applied to X we easily deduce the conclusion, since there are no curves 
of genus zero on an abelian variety ([HSi, Ex. A74(b)])./ / / 

We conclude this section by showing how the Main Theorem leads directly to Siegel's Theorem for the 
case of at least three points at infinity. (As remarked above, one recovers the full result by taking, when 
genus(C) > 0, an unramificd cover of degree > 3 and applying the special case and [V, Thm. 1.4.11].) 

Example 1.5. We prove: Let C be a projective curve and C — C \ {A\, . . . , A s }, s > 3 an affine subset. 
Then there are at most finitely many S-integral points on C. This special case of Siegel's Theorem appears 
as Theorem 1 in [CZ]. We now show how this follows at once from the Main Theorem. First, it is standard 
that one can reduce to nonsingular curves. We then let X = C x C and X = C x C. Then X \ X is the 
union of 2s divisors Di of the form Ai x C or C x Ai, which will be referred to as of the first or second 
type respectively. Plainly, the intersection product (Di.Dj) will be or 1 according as Di,Dj arc of equal 
or different types. We put in the Main Theorem r = 2s, pi = . . . = p r = 1. All the hypotheses are verified 
except possibly (ii). To verify (ii), note that (Di.Di) = 0, {D.D{) = s, D 2 = 2s 2 . Therefore & = s and we 
have to prove that 4s 3 > s 3 + 6s 2 which is true precisely when s > 2. 

We conclude that the S-integral points on C x C arc not Zariski dense, whence the assertion. 

§2. Tools from intersection theory on surfaces. We shall now recall a few simple facts from the theory 
of surfaces, useful for the proof of Main Theorem. These include a version of the Riemann-Roch theorem 
and involve intersection products. (See e.g. [H, Ch. V] for the basic theory.) 

Let X be a projective smooth algebraic surface defined over the complex number field C. We will follow 
the notations of [B] (especially Chapter 1), which are rather standard. For a divisor D on X and an integer 
i = 0,1, 2, we denote by h l {D) the dimension of the vector space H l (A, O(D)). We shall make essential use 
of the following asymptotic version of the Riemann-Roch theorem: 

( 3 ) Angelo Vistoli has pointed out to us that it is the abelianized of tt\{C). 
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Lemma 2.1 Let D be an ample divisor on X . Then for positive integers N we have 

n 2 d 2 

h°(ND) = ^^ + 0(N). 

Proof. The classical Riemann-Roch theorem (see e.g. Theoreme 1.12 of [B] and the following Remarque 1.13) 
gives 

h°(ND) = ^(ND) 2 - ^(ND.K) + X (O x ) + h\ND) - h°(K - ND), 

where K is a canonical divisor of X. The first term is precisely N 2 D 2 /2. Concerning the other terms, 
observe that: h x (ND) and h°(K — ND) vanish for large N; x(®x) is constant; the intersection product 
(ND.K) is linear in N. The result then follows. /// 

We will need an estimate for the dimension of the linear space of sections of ~H°(X, O(ND)) which have 
a zero of given order on a fixed (effective) curve C . We begin with a lemma. 

Lemma 2.2. Let D be a divisor, C a curve on X; then 

h°(D) - h"(D -C)< max{0, 1 + (D.C)}. 

Proof. In proving the inequality we may replace D with any divisor linearly equivalent to it. In particular, 
we may assume that |D| does not contain any possible singularity of C . 
Let us then recall that for every sheaf C the exact sequence 

-» C(-C) -» C -» C\C -» 

gives an exact sequence in cohomology 

-» E°(X,jC(-C)) -» B°(X,jC) -» ft°(C,£\C) -» . . . 

from which we get 

dim(H (!,£)/H (!,£(-C))) < dimH°(C,£|C). 
Applying this inequality with C — 0{D) we get 

h°(D) - h a (D -C)< dimH (C,O(L>)|C). 

The sheaf 0{D)\C is an invertible sheaf of degree (D.C) on the complete curve C. (See [B, Lemme 1.6], 
where C is assumed to be smooth; this makes no difference because of our opening assumption on \D\.) We 
can then bound the right term by max{0, 1 + (D.C)} as wanted./ / / 

Lemma 2.3. Let D be an ample effective divisor on X, C be an irreducible component of D. For positive 
integers N and j we have that either R°(X ,0(ND — jC)) — {0} or 

< h°(ND - jC) - h°(ND - (j + l)C) < N(D.C) - jC 2 + 1. 

Proof. Suppose first that (ND — jC.C) > 0. Then Lemma 2.2 applied with ND — jC instead of D gives 
what we want. If otherwise ND — jC has negative intersection with the effective curve C then 0(ND — jC) 
has no regular sections. In fact, assume the contrary. Then there would exist an effective divisor E linearly 
equivalent to ND - jC, whence E.C = (ND - jC.C) < 0. But E.C must be > 0. (In fact, since E is 
effective we may write E — E\ + rC, where E\ is effective and does not contain C and where r > 0. Then 
E.C = E\.C + rC 2 , whence the claim, in view of C 2 > 0, which in turn follows from (ND - jC.C) < 0). 
This contradiction concludes the proof./// 
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Lemma 2.4 Let D be an ample divisor, C be an effective curve. Then 

D 2 C 2 < {D.C) 2 . 

Proof. This is in fact well known (see e.g. [H, Ch. V, Ex. 1.9]). We give however a short proof for 
completeness. The inequality is non trivial only in the case C 2 > 0. Assume this holds. Then if we had 
D 2 C 2 > {D.C) 2 , the intersection form on the rank two group generated by D and C in Pic(X) would be 
positive definite, which contradicts the Hodge index theorem [H, Ch. V, Thm. 1.9]./// 

§3. Proofs. We shall begin with the proof of Main Theorem, actually anticipating a few words on the 
strategy. Then we shall deduce Theorem 1 from the Main Theorem. In turn, Theorem 1 shall be employed 
for the proof of Corollary 1. 

Proof of Main Theorem. We begin with a brief sketch of our strategy, assuming for simplicity that S consists 
of just one (archimedean) absolute value. In the case treated in [CZ], of an affine curve C with missing points 
A\, . . . , A r , r > 3, we first embed C in a high dimensional space by means of a basis for the space V of regular 
functions on C with at most poles of order N at the given points. Then, going to an infinite subsequence 
{Pi} of the integral points on C, we may assume that Pi — > A, where A is some Aj,. Linear algebra now 
gives functions in V vanishing at A with orders > — N, > —N + 1, . . . , > — N + d, where d = dimV. Such 
functions may be viewed as linear forms in the previous basis and these vanishings imply that the product 
of these functions evaluated at the Pi is small. Then the Subspace Theorem (recalled below) applies. 

The principles are similar in the present case of surfaces, the role of the points Ai being now played by the 
divisors Di. However one has to deal with several new technical difficulties. For instance, the construction of 
the functions with large order zeros is no longer automatic and the quantification involves now intersection 
indices. Moreover, additional complications appear when the integral points converge simultaneously to two 
divisors in the set, i.e. to some intersection point (this is "Case C" of the proof below). 

Now we go on with the details. We shall assume throughout that each of the divisors Di is defined over 

[*ti=Qp] 

k. Also, we assume that each valuation | • |„ is normalized so that if v\p, then \p\ v = p Foi , where k v is 
the completion of k at v, and similarly for archimedean v. As usual, for a point (x\ : . . . : Xd) € P d ~ 1 (k), 
(d > 2), we define the projective height as H{x\ : . . . : Xd) = Y[ v max(|xi|„, . . . , |ccd|^). 

The theorem will follow if we prove that for every infinite sequence of integral points on X , there exists 
a curve defined over k containing an infinite subsequence. In fact, arrange all the curves on X defined over k 
in a sequence C\,Ci, .... Now, if the conclusion of the theorem is not true, we may find for each n an integral 
point P n on X outside C\ U C2 U . . . U C„. But then no given curve C m can contain infinitely many of the 
points Pi. 

Let then {Pi}i G N be an infinite sequence of pairwise distinct integral points on X. By the observation 
just made, we may restrict our attention to any infinite subsequence, and thus we may assume in particular 
that for each valuation v € S the Pi converge w-adically to a point P v G X(k v ). 

We recall that D i} i = 1, . . . ,r, are certain irreducible divisors on X, and that we put D = Y^i=iP%^u 
where pi are positive integers (satisfying the hypotheses of the theorem). 

Fix a valuation v € S. We shall argue in different ways, according to the following three possibilities 
for P v . 

Case A: P v does not belong to the support \D\ of D. 

Case B: P v lies in exactly one of the irreducible components of |D|, which we call D v . 
Case C. P v lies in exactly two of the D^s, which we call D v , D". 

Note that our assumption that no three of the Di's share a common point implies that no other cases 
may occur. 

We fix an integer N, sufficiently large to justify the subsequent arguments. We then consider the 
following vector space V — Vn- 

V N = {tp€ k(X) : div((p) +ND> 0}. 
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Recall that we are assuming that each Di is defined over k, and in particular we may apply the results of the 
previous section. Since X is nonsingular, whence normal, each function in V is regular on X (by [H, Pro. 
6.3A]). Equivalently, V C k[X], i.e. every function in V is a polynomial in the affine coordinates. Let then 
ipi, . . . , ipd be a basis for V over k. (For large enough N , we may assume d > 2.) By the above observation, 
<Pj € so on multiplying all the ipj by a suitable positive integer, we may assume that all the values 

(Pj(Pi) lie in O s . 

For v G S, we shall construct suitable /c-linear forms Li„, . . . , in ipi, . . . , tpa, linearly independent. 
Our aim is to ensure that the product Ylj =1 \Lj v (Pi)\ v is sufficiently small with respect to the "local height" 
of the point (<pi(P»), . . . , (^(P))- 

More precisely, our first aim will be to show that, for a positive number fj, v and for all the points in a 
suitable infinite subsequence of {Pi}, we have 

Y[\L jv (Pi)\ v <£ max(|^(P,)k) , (3-1) 

j=i v J / 

where the implied constant does not depend on i. 

During this construction, where v is supposed to be fixed, we shall sometimes omit the reference to it, 
in order to ease the notation. 

In Case A, we simply choose Lj v = ipj. Since now all the functions ipj are regular at P v , they are 
bounded on the whole sequence Pj. Therefore 

d , 
Y[ \L jv (Pi)\ v < f maxd^^P)!,,) 

3=1 V 3 

where the implied constant does not depend on i, and so (3.1) holds with fj, v = 1. (Note that since the 
constant function 1 lies in V, not all the tpj can vanish at Pj.) 

We now consider Case B, namely the sequence {Pi} converges u-adically to a point P v lying in D v but 
in no other of the divisors Dj. Since X is nonsingular, we may choose, once and for all, a local equation 
t v = at P v for the divisor D v , where t„ is a suitable rational function on X. 

We define a filtration of V = Vjv by putting 

^ :={^ey| ord D ,M >j-l-Np v }, J = l,2,.... (3.2) 

Here we put p v = Pi, if f 11 is the divisor Di. Observe that in fact we have a filtration, since V = W\ D 
W2 D . . ., where eventually Wj = {0}. Starting then from the last nonzero Wj, we pick a basis of it and 
complete it successively to bases of the previous spaces of the filtration. In this way we shall eventually find 
a basis {ipi, . . . , ipd} of V containing a basis of each given Wj. 

In particular, this basis contains exactly dim(Wj/Wj+i) elements in the set Wj \ Wj+\; the order at D v 
of every such element is precisely j — 1 — Np v . Hence 

d 

5>rd D „(^-) = - 1 - N P V ) &MW 3 /W 0+1 ). (3.3) 

3=1 3>1 

Our next task is to obtain a lower bound for the right side. To do this it will be convenient to state separately 
a little combinatorial lemma. 

Lemma 3.1. Let d, U\, . . . ,Uh > and let R be an integer < h such that J2f=i Uj — d. Suppose further 
that the real numbers X\,...,Xh satisfy < Xj < Uj and Y^!j=i x j = ^- Then Y^!j=i J x j — SjLi jUj- 
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Proof. We have 

R h R R 

» + E ,:// + 1 - fa ^ E^ + E ,:/i ' + 1 - j>i 

fl R R 

< E ^ + E^ + 1 - 3)Uj = (R + 1)J2 U r 

But Ef=i(^ + 1 - j>j = (R+ l)d - Ef=i whcnce E?=i JSj > E?=i i^' + («+ l)(d - E?=i U 3 ) and 
the result follows since d — EjLi Uj > 0./// 

We shall apply the lemma, taking Xj := dhn.(Wj /Wj+i) and defining /i to be the number of nonzero 
Wj. Observe that Ej=i x j = dim V = d, consistently with our previous notation. Recall from the previous 
section (Lemma 2.1) that, for D as in the statement of the theorem, 

N 2 D 2 

d=— 2~ + 0(JV), (3.4) 

where the implied constant depends only on the surface X and on the divisor D. 

Further, let us define Uj = 1 + N(D.D V ) - jD v2 for j = l,...,h. Note that, by Lemma 2.3, < xj < Uj 
for j = l,...,h. 

Let £ denote the minimal positive solution of the equation 

D v2 £ 2 -2(D.D V )£ + D 2 = 0, 

so £ = £j if Z? 11 = Dj. Note that by Lemma 2.4 the solutions of this equation are real, and they cannot all 
be < because both D 2 and D.D V are positive (since D is ample). We also deduce that 

(D.D V ) > £D v2 . 

In fact, this is clear if D v2 < 0. Otherwise both roots must be positive, with sum 2 ^ D D ®i - ; and the assertion 
again follows since £ is the minimal root. 

We now choose A to be positive, < £ and such that 

X 2 (D.D V ) X 3 D v2 D 2 p" 



> 0. (3.5) 

This will be possible by continuity, in view of the assumption (ii) of the theorem, applied with & = £. In 
fact, by assumption wc have 2L> 2 £ > (D.D V )£ 2 + 3D 2 p v . 

Now, using the equation for £ we see that 2D 2 £ - (D.D V )£ 2 = 3(D.D V )£ 2 - 2D v2 ^. Therefore the 
previous inequality yields 3(D.D V )£ 2 - 2D v2 f - 3D 2 p v > 0. So (3.5) will be true for all A sufficiently near 
to £. 

Also, since A < £ we have, by definition of £, 

(D.D»)A--^-<— . (3.6) 

We shall apply Lemma 3.1, defining R = [XN]. We first verify that EjLi Uj < d for large enough N. 
In fact, we have 

Uj = RN(D.D") + 0(R + N) 

/ n v2 x 2 \ 

< N 2 (D.D V )X — + Q(N) 



and the conclusion follows from (3.4), since by (3.6) the number into brackets is < D 2 /2. 

Observe that, since < (D.D V ) - £D v2 < (D.D V ) - AD" 2 , we have Uj > for j < R, provided N 
is large enough. Thus, if we had R > h, the sum J2f=i Uj would be strictly larger than J2'j=i x j = d, a 
contradiction which proves that R < h. 

We may thus apply Lemma 3.1, which yields 

h R R 

The right side is N 3 (^1£-Eil _ + 0(1/W)) , so we obtain from £ x j = d > 

h lb, \ 

N~ 3 Y,U ~ 1 - Np v )xj > N~ 3 ^jxj - (Np v + l)d 

>^fl-^-^f + 0(l,N). 

By (3.5) the right side will be positive for large N; together with (3.3) this proves that, if N has been chosen 
sufficiently large, 

d 

5^ordz,.(^)>0. (3.7) 
Now, the functions tpj may be expressed as linear forms in the ipt. We then put Lj v = tpj. We have 

T. = +ord D «(^) 

where are rational functions on X, regular at P v . In particular, the values Pj V (Pi) are defined for large 
i and are w-adically bounded as Pi varies. Hence 



ny^ d ordnv (ib A 
\L ]V {Pi)\ v « MPoir 3=1 



By a similar argument, we have 



max|¥>j(Pj)|„ < |t„(Pi)|„ 
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Both displayed formulas make sense for all but a finite number of the points Pi, which we tacitly exclude. 
Then, the implied constants do not depend on i. 
From these inequalities we finally obtain 

d . 
n \Lj v (Pi)\v < f max | ipj- (Pi) | „ 

for some positive /z„ independent of i; therefore we have shown (3.1) in this case. This concludes our 
discussion of Case B. 

We finally treat Case C, namely the sequence {Pi} converges w-adically to a point P v e D v fl D%, 
where D V ,D^ are two distinct divisors in the set {Di, . . . , D r }. Similarly to the above, we denote by p v , pi 
the corresponding coefficients in D. 

By assumption, P v cannot belong to a third divisor in our set; let us choose two local equations t v = 
and t* = for D v ,Dl respectively. Here t v ,t* are regular functions, vanishing at P v ; also, since D v , D* are 
distinct and irreducible, t v and t* are coprime in the local ring of X at P v . 



We shall now consider two nitrations on the vector space V = Vjv, namely we put 



Wj := W G ^|ord D . (p) > j - 1 - JVp B }, 
:= W G ^|ord D;; (p) > j - 1 - A^}. 

The following lemma from linear algebra will be used to construct a suitable basis for V. 

Lemma 3.2. Let V be vector space of finite dimension d over a field k. Let V = W\ D W 2 D .. . D Wh, 
V = Wf D W 2 * D . . . D Wh* be two filtrations on V. There exists a basis ip\, . . . ,ipd of V which contains a 
basis of each Wj and each W* . 

Proof. We argue by induction on d, the case d = 1 being clear. Then we can certainly suppose (by refining 
the first filtration) that W 2 is a hyperplane in V. Put W[ := W* fl W 2 . By the inductive hypothesis there 
exists a basis tpi, . . . , tpd-i of W 2 containing basis of both W 3 , . . . , Wh and W[, . . . , W' h , . If all the W* for 
i = 2, . . . , h* are contained in W 2 , then W[ = W* for all i > 1; in this case we just complete {ipi, . . . , ipd-i} 
to any basis of V and we are done. Otherwise, let I be the minimum index with Wj* <jt W 2 ; in this case 
let ipd be any element in Wj* \ W 2 . We claim that the basis {ipi, . . . ,ipd} of V has the required property. 
Plainly it contains a basis of every W\, . . . , Wh- Let i be an index in {1, . . . , h*}; we shall prove that the set 
{V>i, ■ ■ ■ , V'd} contains a basis of W{. This is true by construction if i > I, because in this case W* — W(; if 
i < I, then the set {ipi, . . . , ipd] contains the element ipd G Wj* C W* and it contains a basis for W(, which 
is a hyperplane in W* ; hence it contains a basis of W* . 

Now, let ipi , . . . , ipd be a basis as in Lemma 3.2. Again, we define the linear forms Lj V in the ipg to 
satisfy Lj v = ipj. In analogy with Case B, we may write 

t . _ f.ord.Dv ipj . *ord D v 

where the pj v E k(X) are regular at P v ; so, as before, their values at the Pi arc defined for large i and 
w-adically bounded as i oo. Here we have used the fact that P v is a smooth point, so the corresponding 
local ring is a unique factorization domain; in particular if a regular function is divisible both by a power of 
t v and a power of t* (which are coprime), it is divisible by their product. 
Then we have 

where the implied constant does not depend on i. 

Again, from the assumption (ii) applied to D v and D%, the same argument as in Case B gives the 
analogue of (3.7), both for X^=i ordo-oipj and for ordzj^j. Hence, as before, we deduce (3.1). 

In conclusion, we have proved that (3.1) holds for all v e S, for suitable choices of fi v > 0. Also, the 
function constantly equal to 1 lies in V, so is a linear combination of the ifj, so max.\ipj(Pi)\ v >• 1. Thus, 
letting /j, := min^s /j, v > 0, we may write 

d , v -n 

Y[ \L jv (Pi)\ v < ( max.\(pj(Pi)\ v J , veS. 

Our theorem will now follow by a straightforward application of the Subspace Theorem. We recall for 
the reader's convenience the version we are going to apply, equivalent to the statement in [S, Thm. ID', p. 
178]. 

Subspace Theorem. For an integer d > 2 and v G S, let L\ v , . . . , Ldv be independent linear forms in 
Xi, . . . , Xd with coefficients in k, and let e > 0. Then the solutions (x\, . . . , Xa) € 0$ of the inequality 

d 

[I II \ L i v ( Xl > ■■■i x d)\v< H^^xx : ... :x d ) 
ves j=i 
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lie in the union of finitely many proper linear subspaces of k d . 

We apply this theorem by putting (x\, . . . ,Xd) = (pi(Pi), . . . , pd(Pi))- We may assume that H{x\ : . . . : 
Xd) tends to infinity as i — > oo, for otherwise the projective points (x\ : . . . : xj) would all lie in a finite set, 
whence the nonconstant function pi/p2 would be constant, equal say to c, on an infinite subsequence of the 
Pi. In this case the theorem follows, since infinitely many points would then lie on the curve defined on X 
by tpx - cp 2 = 0. 

But then for large i the {x\, . . . ,Xd) satisfy the inequality in the statement of the Subspace Theorem, 
by taking for example e = /i/2. We may then conclude that some nontrivial linear relation cipi(Pi) + 
. . . + Cdfd(Pi) = 0, with fixed coefficients ci, . . . , c<j, holds on an infinite subsequence of the Pi. Again, the 
theorem follows since the <pj are linearly independent./ / / 

Proof of Theorem 1. We let pi, . . . ,p r be positive integers as in the statement, namely there exists a positive 
integer c such that PiPj(Di.Dj) = c for 1 < i, j < r. We have only to check that the assumptions (i), (ii) for 
the Main Theorem are verified with this choice for the Pi. 

Assumption (i) actually appears also in the present theorem. To verify (ii) note that 

(D.Di) = — , D 2 =r\ D\ = 4, 
Pi pf 

and it follows that & = rpi. Hence inequality (ii) amounts to 2r 3 cpi > r 3 cpi + 3r 2 cpi which is equivalent to 
r > 4. This concludes the proof./ / / 

Proof of Corollary 1. We start with a few reductions. First, by Siegel's Theorem we may assume that, given 
a number field fc', only finitely many of the points in question are defined over k' . Next, note that we may 
plainly enlarge S without affecting the conclusion and we now prove that also k may be enlarged; namely, 
it suffices to show in (ii) that all but finitely many quadratic integral points over k are mapped to P 1 (k') by 
one at least of finitely many rational maps tp G k'(C) of degree 2, where k! is a finite extension of k. 

To prove this claim, assume the last statement; we shall deduce the Corollary from it. Conclusion (i) 
of the corollary remains unaltered. We now show (ii); take one of the maps as in the assumed conclusion. 
We may assume that it sends to P 1 (k') the quadratic S'-integral points in an infinite set S. Note that the 
coordinate functions Xi in k[C], i = 1, . . . , m, satisfy by assumption quadratic equations Xf + aiXi + 6^ = 0, 
where at, bi are rational functions of tp; by enlarging k', we may then assume that a i7 bi G k'(ip). By adding 
new coordinates expressed as linear combinations of the original ones, if necessary, the equations show that 
k'(C) has degree < 2 over fc'(a 1; 6 1; . . . , a m , b m ). This last field is contained in k'(ip), and \k'{C) : k'(ijj)] = 2 
by assumption; so k'(ip) = k'{a\, b\, . . . , a m , b m ). 

By the opening remark only finitely many of the points in £ can be defined over k'; in the sequel we 
tacitly disregard these points. By taking suitable linear combinations (over k) of the coordinates, we may 
then assume that for all points P e S and all i = 1, ...,m, Xi(P) k' . Evaluating the equations at 
P e S we obtain Xi(P) 2 + a l (P)Xi(P) + bi(P) = 0. Note that both a i (P),6 J (P) lie in k', since we are 
assuming that tp sends S in k'. Therefore the same equations hold by replacing Xi(P) with its conjugate 
over k: in fact we are assuming that Xi(P) arc quadratic over k, but do not lie in k', and this implies that 
Xi(P) are of exact degree 2 over k! . But then we see that ai(P),bi(P) actually lie in k. Consider the field 
L = k{a\,b\, . . . , a m , b m ). Since L C k'(tp), we see that L is the function field of a curve over k, possibly 
reducible over k! . This curve however has the infinitely many fc-rational points obtained by evaluating the 
ai,bi at P, for P G S. Therefore the given curve is absolutely irreducible and of genus zero and now the 
existence of fc-rational points gives L = k(<p) for a certain function tp G k'(ip). Since ai(P),bi(P) G fc, we 
have <p{P) G fc for P G S. Now, C is absolutely irreducible, so fc is algebraically closed in fc(C). Therefore 
[fc(C) : k(<p)] = [fc'(C) : k'(tp)] — 2, since k'(ip) = k'(ip). Therefore the function p may be used instead of tjj 
to send to P 1 (fc) (rather than P 1 (fc')) the points in S. 

We continue by observing that the integral points on C lift to integral points of a normalization, at 
the cost of enlarging fc and S. Therefore, in view of what has just been shown, we may assume that C is 
nonsingular. 
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We shall then apply Theorem 1 to the surface X = defined as the symmetric product of C with 
itself. (We recall from [Se2, III. 14] that X is in fact smooth.) Then we have a projection map 7r : C x C — > X 
of degree 2. 

We let Di, i = 1, . . . , r, be the image in X under n of the divisor Ai x C C C* x C . 

That the Z)j intersect transversely, and that no three of them share a common point follows from the 
corresponding fact onCxC. Also, note that each is ample on X, as follows e.g. from the Nakai-Moishezon 
criterion. A fortiori, we have that D\ + . . . + D r is ample. Define X := X \ (D\ U . . . U D r ); then X is affinc 
and we may fix some affine embedding. (That the symmetric power of an affine variety is affine follows also 
from a well-known result on quotients of a variety by a finite group of automorphisms; see for instance [Bo, 
Prop. 6.15].) 

Note that tt restricts to a morphism from C x C in X. 

Let now {Pi} be a sequence of S'-integral points on C, such that Pi is defined over a quadratic extension 
ki of k. Letting P[ £ C(fcj) be the point conjugate to Pi over k, we define Qi := (Pj,P/) e C x C and 

Ri ■■= v(Qi) e X(h). 

Observe that Ri £ X(k). In fact, for any function ip £ k(X), we have that ip* — ip o it is a symmetric 
rational function on C x C (that is, invariant under the natural involution of C x C). Therefore <p(Ri) — 
ip*(Pi,P() — ip*(P( 7 Pi)- This immediately implies that <p>(Ri) is fixed by the Galois group Galik/k), proving 
the claim. 

Further, we note that for any ip £ k[X], there exists a positive integer m — m v such that all the values 
rrvp(Ri) are S-integers. In fact, note that ip* is regular on C x C, that is ip* £ k[C x C] = k[C] <X>fc k[C\; this 
proves the contention, since for any function ip £ k[C], the values ip(Pi),ip(P-) differ from S-integers by a 
bounded denominator, as i varies. 

In particular, this assertion holds taking as ip the coordinate functions on X. So, by multiplying such 
coordinates by a suitable positive integer (which amounts to apply an affine linear coordinate change on X) 
we may assume that the Ri are integral points on X. 

We go on by proving that the assumptions for Theorem 1 are verified in our situation. 

Note that the pull-back of Dj in C x C is given by w*(D i ) = A t x C + C x Ai. Since every two points on 
a curve represent algebraically equivalent divisors, we have that the same holds for the ir*(Di). In particular, 
they are numerically equivalent, so the same is true for the D t . Since we plainly have (7r*(_Di).7r*(_D 2 )) = 2, 
it follows that (Di.Dj) = 1 for all pairs i,j ([B, Prop. 1.8]). 

In conclusion, we have verified the assumptions for Theorem 1, with r > 4 and pi = . . . = p r = c = 1. 

From Theorem 1 we deduce that the Ri all lie on a certain closed curve Y C X. To prove our assertions 
we may now argue separately with each absolutely irreducible component of Y. Therefore we assume that 
the Ri are contained in the absolutely irreducible curve Y, defined over a number field containing k. Since 
Y contains the infinitely many points Ri, all defined over k, it follows that Y is in fact defined itself over k. 
Also, Y must have genus zero and at most two points at infinity, because of Siegel's Theorem. In the sequel 
we also suppose, as we may, that Y is closed in X and we let Y be the closure of Y in X and Z = 7r^ 1 (l r ), 
Z = n-\Y) = Z\(UU 1 n*(D l )). 

Assume first that r > 5. Then, since Z is complete at least one of the natural projections on C 
is surjective, whence # {z n (U[ =1 7r*(A))) > 5, and therefore Z \ Z > 5. Hence #(Y \ Y) > 3, since 

#7r~ 1 (i?) < 2 for every R £ X. But then Siegel's Theorem applies to Y and contradicts the fact that Y has 
infinitely many integral points. This proves part (i). 

From now on we suppose that r = 4. 

The case when C is rational can be treated directly, similarly to Example 1.3 above, even without 
appealing to the present methods. By extending the ground field and S, C may be realized as the plane 
quartic (X — A)(A 2 — 1)Y = 1, where A £ k is not ±1. Let (x,y) be a quadratic S'-integral point on C. 
Denoting the conjugation over k with a dash, we have that (x — X)(x' — A) =: r, (x — l)(x' — 1) =: s, 
(x + l)(x' + l)=:t are all S-units in k. Eliminating x, x' gives 2r - (A + l)s + (A - l)t = 2(A 2 - 1) ^ 0. By 
S-unit equation-theory, as in [S2, Thm. 2A] or [V, Thm. 2.3.1], this yields some vanishing subsum for all 
but finitely many such relations. Say that e.g. t = 2(A + 1), 2r = (A + l)s, the other cases being analogous. 
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This leads to x + x' = A + 1 - f , xx' = A + § , whence x 2 - (A + 1 - § )x + A + § = 0, i.e. r = 2(* 2 -(*+!)*+*) . 
Then the map given by m 2 ^ x ~(j v +i) a: + A ) satisfies the conclusion. 

Suppose now that C has positive genus and view C as embedded in its Jacobian J. For a generic point 
R £ Y, let {(P, Q), (Q, P)} = tt-^P) e Z. Then P^P + QeJ is a well-defined rational map from Y 
to J. But Y is a rational curve, and it is well-known that then such a map has to be constant ([HSi, Ex. 
A74(b)]), say P + Q = c for 7r(P, Q) = R £ Y, where c is independent of R. We then have a degree 2 regular 
map V : C -> V defined by V(P) = tt((P, c - P)). It now suffices to note that tp(Pi) = 7r((P, P/)) = Pi is 
an .S-integral point in Y(k). 

Proof for the Addendum. Let ip be one of the mentioned maps, and let {PijieN be an infinite sequence of 
distinct quadratic integral points on C such that ip(Pi) £ k. We have equations Xf + a%Xi + bi = 0, where 
cii,bi £ k(ip). By changing coordinates linearly, we may assume, as in the argument at the beginning of the 
proof of the Corollary 1, that k(C) is quadratic over k(ai,b\, . . . , a m ,b m ) and that all the values at the Pi 
of the affine coordinates X\, . . . , X m are of exact degree 2 over k. Then aj(Pi), bj(Pi) are S- integers in k, 
for all i, j in question. The rational map : P t-^> (ai(P), &i(P), ■ • • , a m (P) , b m (P)) , from C to P 1 , sends 
C to an affine curve Y (over k) with infinitely many 5-integral points over k. This curve, whose affine ring 
is k[Y] = fc[ai, &i, . . . , a m , b m ], can have at most two points at infinity, by Siegel's Theorem. On the other 
hand, the above quadratic equations for the coordinates imply that k[C] is integral over k[Y], whence all of 
the (four) points at infinity of C correspond to poles of some at or 6j. Therefore the a\, b\, . . . , a m , b m have 
altogether at least the four poles A\, ...,A± on C, and so they have at least the poles ^(^i), . . . , ^(Ai), 
viewed as rational functions of ip. But the above rational map (p has degree 2, whence -0 factors through it, 
namely k(tp) = k(Y). Therefore the curve Y has at least #{ip(Ai), . . . ,\j){A±)} points at infinity. By the 
above conclusion this cardinality is at most two, proving the first contention of the addendum. 

As to the second, say that ip(Ai) — f/K^) =: a and ^(^3) = ip(Ai) =: (3. Then has divisor ^ 
(Ai) + (A 2 ) — (A 3 ) — (A4), yielding a relation of the mentioned type among the (A)./ / / 

The authors thank Professors Enrico Bombieri, Barbara Fantechi, Angelo Vistoli and Paul Vojta for 
several very helpful discussions and comments. 
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